4,183 research outputs found

    MDP Optimal Control under Temporal Logic Constraints

    Get PDF
    In this paper, we develop a method to automatically generate a control policy for a dynamical system modeled as a Markov Decision Process (MDP). The control specification is given as a Linear Temporal Logic (LTL) formula over a set of propositions defined on the states of the MDP. We synthesize a control policy such that the MDP satisfies the given specification almost surely, if such a policy exists. In addition, we designate an "optimizing proposition" to be repeatedly satisfied, and we formulate a novel optimization criterion in terms of minimizing the expected cost in between satisfactions of this proposition. We propose a sufficient condition for a policy to be optimal, and develop a dynamic programming algorithm that synthesizes a policy that is optimal under some conditions, and sub-optimal otherwise. This problem is motivated by robotic applications requiring persistent tasks, such as environmental monitoring or data gathering, to be performed.Comment: Technical report accompanying the CDC2011 submissio

    LTL Control in Uncertain Environments with Probabilistic Satisfaction Guarantees

    Full text link
    We present a method to generate a robot control strategy that maximizes the probability to accomplish a task. The task is given as a Linear Temporal Logic (LTL) formula over a set of properties that can be satisfied at the regions of a partitioned environment. We assume that the probabilities with which the properties are satisfied at the regions are known, and the robot can determine the truth value of a proposition only at the current region. Motivated by several results on partitioned-based abstractions, we assume that the motion is performed on a graph. To account for noisy sensors and actuators, we assume that a control action enables several transitions with known probabilities. We show that this problem can be reduced to the problem of generating a control policy for a Markov Decision Process (MDP) such that the probability of satisfying an LTL formula over its states is maximized. We provide a complete solution for the latter problem that builds on existing results from probabilistic model checking. We include an illustrative case study.Comment: Technical Report accompanying IFAC 201

    Probabilistically safe vehicle control in a hostile environment

    Full text link
    In this paper we present an approach to control a vehicle in a hostile environment with static obstacles and moving adversaries. The vehicle is required to satisfy a mission objective expressed as a temporal logic specification over a set of properties satisfied at regions of a partitioned environment. We model the movements of adversaries in between regions of the environment as Poisson processes. Furthermore, we assume that the time it takes for the vehicle to traverse in between two facets of each region is exponentially distributed, and we obtain the rate of this exponential distribution from a simulator of the environment. We capture the motion of the vehicle and the vehicle updates of adversaries distributions as a Markov Decision Process. Using tools in Probabilistic Computational Tree Logic, we find a control strategy for the vehicle that maximizes the probability of accomplishing the mission objective. We demonstrate our approach with illustrative case studies

    Optimality and robustness in multi-robot path planning with temporal logic constraints

    Full text link
    In this paper we present a method for automatically generating optimal robot paths satisfying high-level mission specifications. The motion of the robot in the environment is modeled as a weighted transition system. The mission is specified by an arbitrary linear temporal-logic (LTL) formula over propositions satisfied at the regions of a partitioned environment. The mission specification contains an optimizing proposition, which must be repeatedly satisfied. The cost function that we seek to minimize is the maximum time between satisfying instances of the optimizing proposition. For every environment model, and for every formula, our method computes a robot path that minimizes the cost function. The problem is motivated by applications in robotic monitoring and data-gathering. In this setting, the optimizing proposition is satisfied at all locations where data can be uploaded, and the LTL formula specifies a complex data-collection mission. Our method utilizes Büchi automata to produce an automaton (which can be thought of as a graph) whose runs satisfy the temporal-logic specification. We then present a graph algorithm that computes a run corresponding to the optimal robot path. We present an implementation for a robot performing data collection in a road-network platform.This work was supported in part by the Office of Naval Research (grant number MURI N00014-09-1051), Army Research Office (grant number W911NF-09-1-0088), Air Force Office of Scientific Research (grant number YIP FA9550-09-1-020), National Science Foundation (grant number CNS-0834260), Singapore-MIT Alliance for Research and Technology (SMART) Future of Urban Mobility Project and by Natural Sciences and Engineering Research Council of Canada. (MURI N00014-09-1051 - Office of Naval Research; W911NF-09-1-0088 - Army Research Office; YIP FA9550-09-1-020 - Air Force Office of Scientific Research; CNS-0834260 - National Science Foundation; Singapore-MIT Alliance for Research and Technology (SMART); Natural Sciences and Engineering Research Council of Canada

    Truncated atomic plane wave method for the subband structure calculations of Moir\'e systems

    Full text link
    We propose a highly efficient and accurate numerical scheme named Truncated Atomic Plane Wave (TAPW) method to determine the subband structure of Twisted Bilayer Graphene (TBG) inspired by BM model. Our method utilizes real space information of carbon atoms in the moir\'e unit cell and projects the full tight binding Hamiltonian into a much smaller subspace using atomic plane waves. We present accurate electronic band structures of TBG in a wide range of twist angles together with detailed moir\'e potential and screened Coulomb interaction at the first magic angle using our new method. Furthermore, we generalize our formalism to solve the problem of low frequency moir\'e phonons in TBG

    Optimal multi-robot path planning with temporal logic constraints

    Get PDF
    In this paper we present a method for automatically planning optimal paths for a group of robots that satisfy a common high level mission specification. Each robot's motion in the environment is modeled as a weighted transition system. The mission is given as a Linear Temporal Logic formula. In addition, an optimizing proposition must repeatedly be satisfied. The goal is to minimize the maximum time between satisfying instances of the optimizing proposition. Our method is guaranteed to compute an optimal set of robot paths. We utilize a timed automaton representation in order to capture the relative position of the robots in the environment. We then obtain a bisimulation of this timed automaton as a finite transition system that captures the joint behavior of the robots and apply our earlier algorithm for the single robot case to optimize the group motion. We present a simulation of a persistent monitoring task in a road network environment.United States. Office of Naval Research. Multidisciplinary University Research Initiative (N00014-09-1051)United States. Army Research Office (W911NF-09-1-0088)United States. Air Force Office of Scientific Research (YIP FA9550-09-1-020)National Science Foundation (U.S.). (CNS- 0834260

    Realization of SOC behavior in a dc glow discharge plasma

    Full text link
    Experimental observations consistent with Self Organized Criticality (SOC) have been obtained in the electrostatic floating potential fluctuations of a dc glow discharge plasma. Power spectrum exhibits a power law which is compatible with the requirement for SOC systems. Also the estimated value of the Hurst exponent (self similarity parameter), H being greater than 0.5, along with an algebraic decay of the autocorrelation function, indicate the presence of temporal long-range correlations, as may be expected from SOC dynamics. This type of observations in our opinion has been reported for the first time in a glow discharge system.Comment: Key Words: Glow discharge; Self Organized Criticality; Hurst exponent; R/S technique; Power spectrum; Autocorrelation function; Nongaussian probability distribution function. Phys Lett A (article in Press

    Least Squares Temporal Difference Actor-Critic Methods with Applications to Robot Motion Control

    Get PDF
    We consider the problem of finding a control policy for a Markov Decision Process (MDP) to maximize the probability of reaching some states while avoiding some other states. This problem is motivated by applications in robotics, where such problems naturally arise when probabilistic models of robot motion are required to satisfy temporal logic task specifications. We transform this problem into a Stochastic Shortest Path (SSP) problem and develop a new approximate dynamic programming algorithm to solve it. This algorithm is of the actor-critic type and uses a least-square temporal difference learning method. It operates on sample paths of the system and optimizes the policy within a pre-specified class parameterized by a parsimonious set of parameters. We show its convergence to a policy corresponding to a stationary point in the parameters' space. Simulation results confirm the effectiveness of the proposed solution.Comment: Technical report accompanying an accepted paper to CDC 201

    Multi-UAV Convoy Protection: An Optimal Approach to Path Planning and Coordination

    Get PDF
    (c) 2010 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.Digital Object Identifier : 10.1109/TRO.2010.2042325In this paper we study the problem of controlling a group of Unmanned Aerial Vehicles (UAVs) to provide convoy protection to a group of ground vehicles. The UAVs are modeled as Dubins vehicles flying at a constant altitude with bounded turning radius. We first present time-optimal paths for providing convoy protection to stationary ground vehicles. Then we propose a control strategy to provide convoy protection to ground vehicles moving on straight lines. The minimum number of UAVs required to provide perpetual convoy protection in both cases are derived

    Optimal Motion Primitives for Multi-UAV Convoy Protection

    Get PDF
    (c) 2010 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.Digital Object Identifier : 10.1109/ROBOT.2010.5509221In this paper we study the problem of controlling a number of Unmanned Aerial Vehicles (UAVs) to provide convoy protection to a group of ground vehicles. The UAVs are modeled as Dubins vehicles flying at a constant altitude with bounded turning radius. This paper first presents time-optimal paths for providing convoy protection to static ground vehicles. Then this paper addresses paths and control strategies to provide convoy protection to ground vehicles moving on a straight line. Minimum numbers of UAVs required to provide perpetual convoy protection for both cases are derived
    • …
    corecore